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00 ! Abstract 

Q ■ 

We introduce a new family of codes, termed weighted superimposed codes (WSCs). This family generalizes 
the class of Euclidean superimposed codes (ESCs), used in multiuser identification systems. WSCs allow for 
discriminating all bounded, integer-valued linear combinations of real-valued codewords that satisfy prescribed norm 
and non-negativity constraints. By design, WSCs are inherently noise tolerant. Therefore, these codes can be seen as 
| special instances of robust compressed sensing schemes. The main results of the paper are lower and upper bounds 

on the largest achievable code rates of several classes of WSCs. These bounds suggest that with the codeword and 
weighting vector constraints at hand, one can improve the code rates achievable by standard compressive sensing. 

, I. Introduction 

Superimposed codes (SCs) and designs were introduced by Kautz and Singleton [1], for the purpose of studying 
^ [ database retrieval and group testing problems. In their original formulation, superimposed designs were defined 
00 ! as arrays of binary codewords with the property that bitwise OR functions of all sufficiently small collections of 
^ ', codewords are distinguishable. Superimposed designs can therefore be viewed as binary "parity-check" matrices 
for which syndromes represent bitwise OR, rather than XOR, functions of selected sets of columns. 

The notion of binary superimposed codes was further generalized by prescribing a distance constraint on the 
O ■ OR evaluations of subsets of columns, and by extending the fields in which the codeword symbols lie [2]. In the 
j> . latter case, Ericson and Gyorfi introduced Euclidean superimposed codes (ESCs), for which the symbol field is 

•i-H , 

R, for which the OR function is replaced by real addition, and for which all sums of less than K codewords 
are required to have pairwise Euclidean distance at least d. The best known upper bound on the size of Euclidean 
superimposed codes was derived by Fiiredi and Ruszinko [3], who used a combination of sphere packing arguments 
and probabilistic concentration formulas to prove their result. 

On the other hand, compressed sensing (CS) is a new sampling method usually applied to K-sparse signals, i.e. 
signals embedded in an iV-dimensional space that can be represented by only K <C N significant coefficients [4]- 
[6]. Alternatively, when the signal is projected onto a properly chosen basis of the transform space, its accurate 
representation relies only on a small number of coefficients. Encoding of a K-sparse discrete-time signal x of 
dimension N is accomplished by computing a measurement vector y that consists of m < iV linear projections, 
i.e. y = <hx. Here, <3? represents anmxAf matrix, usually over the field of real numbers. Consequently, the measured 
vector represents a linear combination of columns of the matrix <3?, with weights prescribed by the nonzero entries 
of the vector x. Although the reconstruction of the signal x G R from the (possibly noisy) projections is an 
ill-posed problem, the prior knowledge of signal sparsity allows for accurate recovery of x. 

*This work is supported by the NSF Grant CCF 0644427, the NSF Career Award, and the DARPA Young Faculty Award of the second 
author. Parts of the results were presented at the CCIS'2008 ant ITW'2008 conferences. 
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The connection between error-correcting coding theory and compressed sensing was investigated by Candes 
and Tao in [7], and remarked upon in [8]. In the former work, the authors studied random codes over the real 
numbers, the noisy observations of which can be decoded using linear programming techniques. As with the case 
of compressed sensing, the performance guarantees of this coding scheme are probabilistic, and the if-sparse signal 
is assumed to he in H. . 

We propose to study a new class of codes, termed weighted superimposed codes (WSCs), which provide a link 
between SCs and CS matrices. As with the case of the former two entities, WSCs are defined over the field of real 
numbers. But unlike ESCs, for which the sparse signal x consists of zeros and ones only, and unlike CS, for which 
x is assumed to belong to R , in WSCs the vector x is drawn from B N , where B denotes a bounded, symmetric 
set of integers. The motivation for studying WSCs comes from the fact that in many applications, the alphabet of 
the sparse signal can be modeled as a finite set of integers. 

Codewords from the family of WSCs can be designed to obey prescribed norm and non-negativity constraints. 
The restriction of the weighting coefficients to a bounded set of integers ensures reconstruction robustness in the 
presence of noise - i.e., all weighted sums of at most K codewords can be chosen at "minimum distance" d from 
each other. This minimum distance property provides deterministic performance guarantees, which CS techniques 
usually lack. Another benefit of the input alphabet restriction is the potential to reduce the decoding complexity 
compared to that of CS reconstruction techniques. This research problem was addressed by the authors in [9]— [1 1], 
but is beyond the scope of this paper. 

The central problem of this paper is to characterize the rate region for which a WSC with certain parameters exists. 
The main results of this work include generalized sphere packing upper bounds and random coding lower bounds 
on the rates of several WSC families. The upper and lower bounds differ only by a constant, and therefore imply 
that the superposition constraints are ensured whenever m = O (K log N / log K) . In the language of CS theory, 
this result suggests that the number of required signal measurements is less than the standard O (K log (N/K)), 
required for discriminating real-valued linear combinations of codewords. This reduction in the required number 
of measurements (codelength) can be seen as a result of restricting the input alphabet of the sparse signal. 

The paper is organized as follows. Section [III] introduces the relevant terminology and definitions. Section [TV] 
contains the main results of the paper - upper and lower bounds on the size of WSCs. The proofs of the rate 
bounds are presented in Sections [V] and I VIII I Concluding remarks are given in Section [DO 

II. Motivating Applications 

We describe next two applications - one arising in wireless communication, the other in bioengineering - 
motivating the study of WSCs. 

The adder channel and signature codes: One common application of ESCs is for signaling over multi-access 
channels. For a given set of k < K active users in the channel, the input to the receiver y equals the sum of 
the signals (signatures) x^ , j = 1, . . . , k, of the k active users, i.e. y = ^j =1 Xi r The signatures are only used 
for identification purposes, and in order to minimize energy consumption, all users are assigned unit energy [2], 
[3]. Now, consider the case that in addition to identifying their presence, active users also have to convey some 
limited information to the receiver by adapting their transmission power. The received signal can in this case be 
represented by a weighted sum of the signatures of active users, i.e., y = Ylj=i \/P~iJ :x -i r The codebook used in 
this scheme represents a special form of WSC, termed Weighted Euclidean Superimposed Codes (WESCs); these 
codes are formally defined in Section [III] 

Compressive sensing microarrays: A microarray is a bioengineering device used for measuring the level of certain 
molecules, such as RNA (ribonucleic acid) sequences, representing the joint expression profile of thousands of genes. 
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A microarray consist of thousands of microscopic spots of DNA sequences, called probes. The complementary DNA 
(cDNA) sequences of RNA molecules being measured are labeled with fluorescent tags, and such units are termed 
targets. If a target sequence has a significant homology with a probe sequence on the microarray, the target cDNA 
and probe DNA molecules will bind or "hybridize" so as to form a stable structure. As a result, upon exposure to 
laser light of the appropriate wavelength, the microarray spots with large hybridization activity will be illuminated. 
The specific illumination pattern and intensities of microarray spots can be used to infer the concentration of RNA 
molecules. In traditional microarray design, each spot of probes is a unique identifier of only one target molecule. 
In our recent work [12], [13], we proposed the concept of compressive sensing microarrays (CSM), for which each 
probe has the potential to hybridize with several different targets. It uses the observation that, although the number 
of potential target RNA types is large, not all of them are expected to be present in a significant concentration at 
all observed times. 

Mathematically, a microarray is represented by a measurement matrix, with an entry in the i th row and the j th 
column corresponding to the hybridization probability between the i th probe and the j th target. In this case, all the 
entries in the measurement matrix are nonnegative real numbers, and all the columns of the measurement matrix are 
expected to have /i-norms equal to one. In microarray experiments, the input vector x has entries that correspond to 
integer multiples of the smallest detectable concentration of target cDNA molecules. Since the number of different 
target cDNA types in a typical test sample is small compared to the number of all potential types, one can assume 
that the vector x is sparse. Furthermore, the number of RNA molecules in a cell at any point in time is upper 
bounded due to energy constraints, and due to intracellular space limitations. Hence, the integer-valued entries of 
x are assumed to have bounded magnitudes and to be relatively small compared to the number of different RNA 
types. With the above considerations, the measurement matrix of a CSM can be described by nonnegative Zi-WSCs, 
formally defined in Section |III] 

III. Definitions and Terminology 
Throughout the paper, we use the following notation and definitions. 

A code C is a finite set of codewords (vectors) v, G R mxl , i = 1, 2, • • • ,N. The code C is specified by its 
codeword matrix (codebook) C G W mxN , obtained by arranging the codewords in columns of the matrix. 
For two given positive integers, t and K, let 

B t = [-t, t] = {-t, -t + 1, • • • , t - l,t} C Z 

be a symmetric, bounded set of integers, and let 

B K = {b€Bf r : ||b|| <iT} 

denote the Iq ball of radius K, with ||b|| representing the number of nonzero components in the vector b (i.e., 
the support size of the vector). We formally define WESCs as follows. 

Definition 1: A code C is said to be a WESC with parameters (N, m, K, d, rj, B t ) for some d G (0, vj), if 

1) c g m mxN , 

2) Il v i|l2 = Vi f° r all i = 1, • • • ,N, and, 

3) if the following minimum distance property holds: 

d E (C,K,B t ) := min ||Cbi - Cb 2 || 2 > d 

bi^b 2 

for all bb b 2 G Bk- 
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Henceforth, we focus our attention on WESCs with rj = 1, and denote the set of parameters of interest by 

(N,m,K,d, B t ). 

The definition above can be extended to hold for other normed spaces. 

Definition 2: A code C is said to be an / P -WSC with parameters (N,m, K, d, B t ) if 

1) Ce R mxN , 

2) ll v «ll/ p = 1> f° r alH = 1, • • • , N, and, 

3) if the following minimum distance property holds: 

d p (C,K,B t ):= min ||Cbi - Cb 2 ||j >d 

t>i^b 2 p 

for all bi, b 2 G Bk- 
Note that specializing p = 2 reproduces the definition of a WESC. 

Motivated by the practical applications described in the previous section, we also define the class of nonnegative 
Zp-WSC. 

Definition 3: A code C is said to be a nonnegative Z P -WSC with parameters (N, m, K, d, Bt) if it is an Z P -WSC 
such that all entries of C are nonnegative. 

Given the parameters m, K, d and B t , let N (m, K, d, B t ) denote the maximum size of a WSC, 

N (m, K, d, B t ) := max {N : C (N, m, K, d,B t )^<t>}. 

The asymptotic code exponent is defined as 

R ,„, R , log N(m,K,d,B t ) 
R (K, d, Bt) := hm sup . 

m— >oo Tfl 

We are interested in quantifying the asymptotic code exponent of WSCs, and in particular, WESCs and nonneg- 
ative WSCs with p = 1. Results pertaining to these classes of codes are summarized in the next section. 

IV. On the Cardinality of WSC Families 

The central problem of this paper is to determine the existence of a superimposed code with certain parameters. 
In [2], [3], it was shown that for ESCs, for which the codeword alphabet B t is replaced by the asymmetric set 
{0, 1}, one has 

^ (1 + o d (1)) < R (K, d,{l}) < ^ (1 + o d (1)) , 

where o<j(l) converges to zero as K — > oo. 

The main result of the paper is the upper and lower bounds on the asymptotic code exponents of several WSC 
families. For WESCs, introducing weighting coefficients larger than one does not change the asymptotic order of 
the code exponent. 

Theorem 1: Let t be a fixed parameter. For sufficiently large K, the asymptotic code exponent of WESCs can 
be bounded as 

^£ (1 + o (1)) < R (K, d, B t ) < ^ (1 + o t4 (1)) (1) 

where o (1) — ► and o t .d (1) — > as K — > oo. The exact expressions of the o (1) and o t) d (1) terms are given in 
Equations ( fT9l ) and (0, respectively. 

Remark 1: The derivations leading to the expressions in Theorem Q] show that one can also bound the code 
exponent in a non-asymptotic regime. Unfortunately, those expressions are too complicated for practical use. 
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Nevertheless, this observation implies that the results pertaining to WESC are applicable for the same parameter 
regions as those arising in the context of CS theory. 

Remark 2: The parameter t can also be allowed to increase with K. For WESCs, the value of t does not affect 
the lower bound on the asymptotic code exponent, while the upper bound is valid as long as t = o (K). 

For clarity of exposition, the proof of the lower bound is postponed to Section |VIJ while the proof of the upper 
bound, along with the proofs of the upper bounds for other WSC families, are presented in Section [V] We briefly 
sketch the main steps of the proofs in the discussion that follows. 

The proof of the upper bound is based on the sphere packing argument. The classical sphere packing argument 
is valid for all WSC families discussed in this paper. The leading term of the resulting upper bound is (log K) /K. 
This result can be improved when restricting one's attention to the Euclidean norm. The key idea is to show 
that most points of the form Cb lie in a ball of radius significantly smaller than the one derived by the classic 
sphere packing argument. The leading term of the upper bound can in this case be improved from (log K) /K to 
{\ogK)/(2K). 

The lower bound in Theorem Q] is proved by random coding arguments. We first randomly generate a family of 
WESCs from the Gaussian ensemble, with the code rates satisfying 

logiV logK 
hm < — — - (1 + o(l)j . 

(m,N)-*oo m 4K 

Then we prove that these randomly generated codebooks satisfy 

d E (C,K) > d 

with high probability. This fact implies that the asymptotic code exponent 

log N (m, K, d,B t ) 



R (K, d,B t ) = lim sup 



in 



log K . , , , 

-^T (1 + o(1)) - 

We also analyze two more classes of WSCs: the class of general Zi-WSCs and the family of nonnegative Zi-WSCs. 
The characterization of the asymptotic code rates of these codes is given in Theorems |2] and |3j respectively. 

Theorem 2: For a fixed value of the parameter t and sufficiently large K, the asymptotic code exponent of 
Zi-WSCs is bounded as 

^ (1 + o (1)) < R (K, d, Bt) < ^ (1 + o t4 (1)) , (2) 

where the expressions for o (1) and o t ^ (1) are given in Equations (f3TT > and ©, respectively. 

Proof: The lower bound is proved in Section IVIIi while the upper bound is proved in Section [V] ■ 

Theorem 3: For a fixed value of the parameter t and sufficiently large K, the asymptotic code exponent of 
nonnegative Zi-WSCs is bounded as 

^ (1 + ot (1)) < R (K, d, B t ) < (1 + o t>d (1)) , (3) 

where the expressions for o% (1) and ot,d (1) are given by Equations (l40l) and (O, respectively. 

Proof: The lower and upper bounds are proved in Sections IVIIII and [V] respectively. ■ 

Remark 3: The upper bounds in Equations © and Q also hold if one allows t to grow with K, so that t = o (K). 
The lower bound in Q for general Zi-WSCs does not depend on the value of t. However, the lower bound (f3]) for 
nonnegative Zi-WSCs requires that t = o (i^ 1 / 3 ) (see Equation (l40b for details). This difference in the convergence 
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regime of the two Zi-WSCs is a consequence of the use of different proof techniques. For the proof of the rate regime 
of general Zi-WSCs, Gaussian codebooks were used. On the other hand, for nonnegative Zi-WSCs, the analysis is 
complicated by the fact that one has to analyze linear combinations of nonnegative random variables. To overcome 
this difficulty, we used the Central Limit Theorem and Berry-Essen type of distribution approximations [14]. The 
obtained results depend on the value of t. 

Remark 4: The upper bound for WESCs is roughly one half of the corresponding bound for /i-WSCs. This 
improvement in the code exponent of WESCs rests on the fact that the ^-norm of a vector can be expressed as 
an inner product, i.e. ||v|| 2 = v T v (in other words, <2 is a Hilbert space). Other normed spaces considered in the 
paper lack this property, and at the present, we are not able to improve the upper bounds for / p -WSCs with p 7^ 2. 



V. Proof of the Upper Bounds by Sphere packing Arguments 

It is straightforward to apply the sphere packing argument to upper bound the code exponents of WSCs. Regard 
an /p-WSC with arbitrary p £ Z + . The superposition Cb satisfies 

l|b|| 

IICblL < V IWi-bi W < Kt 

II lip — / j II h h lip — 
j = l 

for all b such that ||b|| < K, where the 6j 3 s, 1 < j < ||b|| < K, denote the nonzero entries of b. Note that the 
l p distance of any two superpositions is required to be at least d. The size of the / P -WSC codebook, N, satisfies 
the sphere packing bound 

A simple algebraic manipulation of the above equation shows 



so that one has 



< 1 i og ( 1 + ™i\ _ log(2t) _ Mj^j) 
K \ d J m m 



logK \ fit 1 
+ lo; 



logiV 



m 



K K \d K 

log(2t) log(^-^) 
m m 

The asymptotic code exponent is therefore upper bounded by 

\ogK 
K 

where 



(l + o M (l)), (5) 



OM(l) = 1 ° g 1 (f ^ U -^°0 (6) 
log K 

if t = o(K). 

This sphere packing bound can be significantly improved when considering the Euclidean norm. The result is an 
upper bound with the leading term (log if) / (2K). The proof is a generalization of the ideas used by Fiiredi and 
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Ruszinko in [3]: most points of the form Cb lie in a ball with radius smaller than 1), and therefore the 

right hand side of the classic sphere packing bound (f5]) can be reduced by a factor of two. 
To proceed, we assign to every b £ Bk the probability 

1 _ 1 

i^T~£f=i (!) (2* + i) fc ' 

For a given codeword matrix C, define a random variable 

£=l|Cb|| 2 . 

We shall upper bound the probability Pr{£ > for arbitrary A,/iG M + , via Markov's inequality 

We calculate E [£ 2 ] as follows. For a given vector b, let / C [1,VY] be its support set - i.e., the set of indices 
for which the entries of b are nonzero. Let hj be the vector composed of the nonzero entries of b. Furthermore, 
define 



B t)k = (B t \{0}) k - 



Then, 



E 



e 



k=l \I\=k bieB t ,A 



where ij E I, j = 1, - ■ ■ ,k. Note that 

E E 



|/|=fc hj&B t , k \j=l l<l¥=3<k 

E EK+E E E m,v*v< 

|/|=fc b,eB a i=i |/|=fc b,eB tlt i<lft<k 



(*) (**) 

It is straightforward to evaluate the two sums in the above expression in closed form: 



) e hi=ciy e »i 

\(2t) fe - 1 £ bl 

/ J, /- D 



Jfe(2t) 



/; _ 1 t(i + l)(2i + l) > 
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and 



(**) 



(?)<*>*-' E E vU 



= 0, 



where the last equality follows from the observation that 



^ ^ v,: v, 



+ E 

0. 



6i,>0,6 i;/ eBt,i 



Consequently, one has 



E E 



J'=l 



so that 



E[£ 



N\ (2t) k k (t + l)(2t + l) 
fc / 6 



S^K (N\ (2t) k k(t+l)(2t+l) 
21 _ <Efc=l \ k) 6 



Ef =1 (?) m k 

Next, substitute E [£ 2 ] into Markov's inequality, with 
so that for any A > 1, it holds that 

Pr(£>A^)<i. 

This result implies that at least a (1 — 1/ A) -fraction of all possible Cb vectors lie within an m-dimensional ball 
of radius A/Lt around the origin. As a result, one obtains a sphere packing bound of the form 

A/x+f \ 



1-^1 \Bk\ < 



Note that 



and that 



» 2 = E[e]<j (t+i) 



\B K \> 



N-K\ K ,„., K 



K 



(2ty 
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Consequently, one has 

1\ (N-K\ K K ( \Vk(t + l) 



i - t -^r- m < i + 



A/ V K 7 \ d 

or, equivalently, 

logiV logif 1 /A(t + 1) , 1 

^ + T7 log h 



m ~ IK K ° V d v 7 ^ 



M 1 -*) Mog^ 1 ^ log(2t) 



rai( m \K N J m 

Without loss of generality, we choose A = 2. The asymptotic code exponent is therefore upper bounded by 

log if 



2K 



where 



if i = o (iiT). This proves the upper bound of Theorem [T] 

VI. Proof of the Lower Bound for WESCs 

Similarly as for the case of compressive sensing matrix design, we show that standard Gaussian random matrices, 
with appropriate scaling, can be used as codebooks of WESCs. Let H G IR mxAr be a standard Gaussian random 
matrix, and let hj denote the j th column of H. Let Vj = hj/ \\h.j\\ 2 and C = [v% ■ ■ ■ ~vn}- Then C is a codebook 
with unit /2-norm codewords. Now choose a 6 > such that d (1 + 5) < 1. Let 

N 

(8) 



be the event that the normalized Z 2 -norms of all the columns of H concentrate around one. Let 

E 2 = |J {H: ||C(bi-b 2 )|| 2 >d}. (9) 

In other words, E 2 denotes the event that any two different superpositions of codewords lie at Euclidean distance 
at least d from each other. In the following, we show that for any 

*<^(i + .(i». 



for which o (1) is given by Equation ( fl9l ), if 



logiV 

lim — — < R, (10) 

(m,iV)-»oo m 



then 



lim Pr(E 2 ) = l. (11) 

(m,N)— -+00 



This will establish the lower bound of Theorem Q] 
Note that 

Pr (E 2 ) > Pr (E 2 p| Eh) = Pr (Eh.) - Pr (e x f] E 
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According to Theorem HI stated and proved in the next subsection, one has 



Thus, the desired relation (TTTb holds if 



Observe that 



where 



and 



lim Pr(£7i) = l. 

(m,N)— >oo 



lim Pr ( Ei H ES ) = 0. 

(m,AT)-»oo V 1 1 V 



C (bi - b 2 ) 



Hb', 



m 



b' := A H (b x - b 2 ) 
raj ||hi || 2 



aM/ ll h Ar|| 2 

By Theorem [19] in Section |VI-B[ in the asymptotic domain of ( fTOb , 



Pr^n{ H: ^H((l + J)b')<d(l + 5)}) 

=Pr (^n{ H: ^ Hb/ ^}) 

= Pr( J E 1 f > |{H: C(b x -b 2 )<d}) 



0. 



This establishes the lower bound of Theorem Q] 



(12) 



(13) 



A. Column Norms of H 

In this subsection, we quantify the rate regime in which the Euclidean norms of all columns of H, when properly 
normalized, are concentrated around the value one with high probability. 

Theorem 4: Let H 6 ]^ mxN be a standard Gaussian random matrix, and hj be the j th column of H. 
1) For a given S £ (0, 1), 

Pr 



1 II V. 11=2 1 

— h i 2 ~ 1 

m 



>5 ) < 2exp (- j<5 2 



for all l<j<N. 
2) If m, N —> oo simultaneously, so that 



1 5 2 
lim — log N < — , 

(m,N)^oom 4 



then it holds that 



(m 



lim Pr f I I { 



-IIM2- 1 

m 



> 5 
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Proof: The first part of this theorem is proved by invoking large deviations techniques. Note that ||hj || 2 
Yli=i l-^i,i| 2 i s chi-square distributed. We have 



and 



Pi- 



(°) r 

< exp < — m ( a (1 + 5) — log E 



)} 



1 



(6) 



exp | — m I a (1 + 5) + - log (1 — 2a) 
exp {-^(5 -log (1 + 5))}, 



Pr 



< 1-5 



i=i 



(c) r 

< exp <^ m ( a (1 — (5) + log E 



1 



exp |m I a (1 — 5) — - log (1 + 2a) 
exp{-y(-log(l-5)-«5)} ; 



(14) 



where (a) and (c) hold for arbitrary a > 0, and (6) and (d) are obtained by specializing a to 
respectively. By observing that 

- log (1 - 5) - 5 > 5 - log (1 + 5) > 0, 



(15) 

1 8 „ nt i 1 S 

2 1+5 dnu 2 1-5' 



we arrive at 



Pr 



1 iiv, ll 2 i 
— h i 2 ~ 1 

m 



> 5 



<2exp{-^(<5-log(l + <5))} 



< 2 exp {- —S 2 

The second part of the claimed result is proved by applying the union bound, i.e. 



1 IIVi ||2 1 

— \\ h j\\ - 1 
m 



<2iYexp{-^,5 2 } 



> S 



) I I 1 i Ar^ lo § 2 
exp < — m — — log A/ H 

V 4 V »n m 



This completes the proof of Theorem 4. 



B. The Distance Between Two Different Superpositions 

This section is devoted to identifying the rate regime in which any pair of different superpositions is at sufficiently 
large Euclidean distance. The main result is presented in Theorem [6] at the end of this subsection. Since the proof 
of this theorem is rather technical, and since it involves complicated notation, we first prove a simplified version 
of the result, stated in Theorem [5] 
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Theorem 5: Let H € W nxN be a standard Gaussian random matrix, and let 5 6 (0, 1) be fixed. For sufficiently 
large K, if 

r log iV log A' 

hm < — — - (1 + o s (1)) 

(m,N)^oo m 4K 

where the exact expression for os (1) given by Equation (TT8T ). then 



lim Pr 



1 



rn 



\Hb\\l < 5 2 



0, 



for all h£Bg such that ||b|| < 2K. 

Proof: By the union bound, we have 

Pr I |J 

b|L<2iC 



H : — [iHbllo < 5 2 

m 



pr (u U 

\k=l ||b|| =fe v 



|Hb||a < 5 2 



2K 



< 



E(fc) (4t) fc Prfl||Hb||^<^ ||bL-/, 



k=l 



m 



We shall upper bound the probability 



Pr 



m 



|Hbl|2 < 5 , ||b|| 



for each fc = 1, • • • , 2 K. From Chernoff 's inequality, for all a > 0, it holds that 



Pr ( — HHbllo < 5 2 , ||b|| 
< exp jm ^a<5 2 + log E < 



,-a(H 4 ,.b) 2 



)}• 



where Hj . is the i th row of the H matrix. Furthermore, 



E 



-a(H 4 ,.b) 2 



E 



(a) 

< E 



exp{-a||b|| 2 (H v (b/||b|| 2 )) 2 } 
exp{-ak(Hi,.(b/\\b\\ 2 )) 2 } 



(b) l 

< --log(l + 2afc) 



(16) 



(17) 



where (a) follows from the fact that ||b|| 2 > k for all b € B^ t such that ||b|| = k, and where (b) holds because 
Hj r (b/ ||b|| 2 ) is a standard Gaussian random variable. Let 



a 



1 k-6 2 
2k S 2 



1 1 



13 



Then 



Pv^\\nh\\l<5 2 , \\b\\ = k 

<ex P {m(Q-^)-ilogA 
= exp l Yl + ~fc ' 

Substituting the above expression into the union bound gives 

H : — ||Hb|| 2 < 5 2 

m 

log k — log 5 + — 




k=l 



<^exp j-m/c + 



— logiV log (4t) 

m m 



log 5 2 //c — log 5 2 — 1 



2 A; 



log iV — — log (4i) 

m m 



Now, let if be sufficiently large so that 



log 2K 5 2 /(2K) -log5 2 - 1 
AK + Ik 

/log A; <5 2 /£;-log<5 2 - 1 
= min — 1 

i<k<2K \ 2k 2k 

If 

log JV log* 
hm < (1 + os (1)) , 

(m,iV)^oo m AK 

where 

m log2 + ^/(2K)-log^-l 
° 5(1) = togK ' (18) 

then 

lim Pr [ — ||Hb|| 2 < 5 2 ) = 



for all b G 5^ such that ||b|| < 2K. This completes the proof of the claimed result. ■ 

Based on Theorem [5j the asymptotic region in which Pr (E%f) E\) — > is characterized in below. 

Theorem 6: Let H S M mxAr be a standard Gaussian random matrix, and for a given H, let Ah be as defined in 
(fT3T ). For a given d G (0, 1), choose a 5 > such that d(l + 6) < 1. Define the set £i as in ([8]). For sufficiently 
large K, if 

lim ^<^(l + o( l)) 

(m,N)-*oo m AK 

where 

°M = ^. (W) 

log K 



14 



then 

' ' ' (20) 



lim Pr (- ||HA H (bi - h 2 )\\ 2 2 < d 2 , Ex) = 0, 



(m, 

for all pairs of h>i, b 2 € Bk such that bi / h 2 . 

Proof: The proof is analogous to that of Theorem 0] with minor changes. Let b' = Ah (bi — b2). On the set 
E\, since 

"h,-|L<l + 5 



m " 3 2 



for all 1 < j < N, the nonzero entries of (1 + S) h' satisfy 

\(l + 5)b'\>l. 

Replace b in Theorem [5] with (1 + 5) b'. All the arguments in the proof of Theorem [5] are still valid, except that 
the higher order term is changed to 

d(l+8) (1) 

log 2 + d 2 (1 + 5) 2 j (2K) - 1 - log (d 2 (1 + 5)' 



log if 

log2-l 
- logif ' 

This completes the proof of the theorem. ■ 

VII. Proof of the Lower Bound for Zi-WSCs 

The proof is similar to that of the lower bound for WESCs. Let A £ W nxN be a standard Gaussian random 
matrix, and let H be the matrix with entries 



H U = 1 - * - m ' 1 - 3 - N - 

Once more, let hj be the j th column of H. Let Vj = hj/ [|hj r [| and C = [vi • • • Vjv]. Then C is a codebook with 
unit Zi-norm codewords. Now choose a 5 > such that d (1 + S) < 1. Let 

* r i i 
£i = Ui H: -IIMie(i-M + ^)L (2i) 

and 

E 2 = [j {H: ||C(bi -b 2 )Hi > d}. (22) 
We consider the asymptotic regime where 

lim 1 ^<R, 

(m,N)-*oo 171 

*<^<i + .(i», 

and Od (1) is given in Equation (|3"TT) . Theorem [7] in Section IVII-AI suggests that 

lim Pr(£i) = l, 

(m,N)— +oo 



15 



while Theorem [8] in Section IVII-BI shows that 



lim Pr [ Ei Pi & 

(m,iV)-KX) V ' 



0. 



Therefore, 



(m,N)—*oo (m,N)—xx 

This result implies the lower bound of Theorem [2] 



lim Pr(£ 2 )> lim Pr (EA - Pr (e x ClEg) = 1 

ATWoc fm.ATWoo V 1 1 / 



A. Column Norms of H 

The following theorem quantifies the rate regime in which the l\ -norms of all columns of H, with proper 
normalization, are concentrated around one with high probability. 

Theorem 7: Let A G W nxN be a standard Gaussian random matrix. Let H be the matrix with entries 

V2n 



H, 



Aij, l<i<m, 1 < j < N. 



Let hj be the j column of H. 
1) For a given 6 G (0, 1), 



Pr 



for some positive constant c\ and c 2 ; 
2) Let m, N —>■ oo simultaneously, with 



-IIMi-i 

m 



1 



> 8 I < c\e 



-mc 2 S 2 



lim — logiV < c 2 S 2 . 

(m,N)^oom 



The it holds that 



(r 



lim Pr ( I J { 



rn 



> 5 



0. 



Proof: 

1) Since A^j is a standard Gaussian random variable, \Ai j\ is a Subgaussian distributed random variable, and 
E = -^=. According to Proposition Q] in Appendix lAl — is a Subgaussian random variable 

with zero mean. A direct application of Theorem [12] stated in Appendix [A] gives 

1 



Pr 



in 



mili-1 



> S 



- p AU lA '" 1 

< ci exp (— C2m5 2 ) 



2m5\ 



2vr I 



which proves claim 1). 
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2) This part is proved by using the union bound: first, note that 



If 



then one has 



m 



"•Jill 



> 8 



< exp (— mc2<5 + log c\ + log iV) 

= exp < — m ( C2^ 2 log ci log TV 

V m m 



lim — logiV < c 2 <5 2 , 

(m,N)^oom 



(m 



lim Pr f I I / 



-IIMl-1 

m 



> 5 



This completes the proof of claim 2). 



B. The Distance Between Two Different Superpositions 

Similarly to the analysis performed for WESCs, we start with a proof of a simplified version of the result needed 
in order to simplify tedious notation. We then explain how to establish the proof of Theorem [9] by modifying some 
of the steps of the simplified theorem. 

Theorem 8: Let A € flj mxAr a standard Gaussian random matrix. Let H be the matrix with entries 

/2tt 



H i,j 



Aij, l<i<m, l<j<N. 



Let 5 € (0, 1) be given. For sufficiently large K, if 

log N log K 



lim 



< 



where 



then 



(m,N)^oo m AK 

2 



7T 



logK V ° 2<J 



1 



lim Pr — I Hbl , < S 

(m,JV)-+oo \ m 

for all be5 2 ^ such that ||b|| < 2K. 

Proof: The proof starts by using the union bound, as 

1 



Pr 



|b|| <2^ 



sEQw'P'^liHbiusi.iibi^*) 



To estimate the above upper bound, we have to upper bound the probability 



Pr 



1 



m 



|Hb|| x < 5, ||b| 



(23) 
(24) 



(25) 
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for each k = 1, • • ■ , 2K. Let us derive next an expression for such an upper bound that holds for arbitrary values 
of k > 1. 



Note that 



E 



(a) 



-a|E?=iM«,i 



2vr IITol 



-e ^e -01 ■ da 



o \/2t7 

2|lhl|2 /"OO 2 



g °2 g «ll b ll 2 X ' . (Ix 



e 2 



(x + a llbl 







exp 



dx 



((,) c 2 ||b|| 2 z" 00 

= e 2 



< e 2 
1 



2 / x- 

^= ex P 

a||b|| 2 V27T 

2 nhii 2 r°° ^ 2 



• dx 



x 



a||b|| 2 a Il b ll2 V27T 



3; 



exp — — • dx 



-.2 ii v. II 2 „ ,2 I iv. ii 2 



a llbl 



2vr 



e 2 e 



( c ) 1 2 

< 



(26) 



a V27rA;' 

where (a) and (6) follow from the change of variables x' = xj ||b|| 2 and x' = x + a ||b|| 2 , respectively. Inequality 
(c) holds based on the assumption that ||b|| 2 > k. As a result, 



i=l 

-t 

m. ^— ' 



< 5 



3=1 

25 

< exp ^ m ( a / + log E 

27T 



< 



< exp \ m [ a + log 

V 27T 



2<5 



exp < m a 



log a 



2?r 

g-«|E 3 b 3 H l\ 

2 1 

\/2ttA; a 
nk 




exp jm ^1 — log {y~k— 



25 

where the last equality is obtained by specializing a = y/2ir/2 5. 

The upper bound in (|2"71) is useful only when it is less than one, or equivalently, 

log (ykJL) > i. 

For any 5 € (0, 1), if k > 4, inequality (1281 ) holds. Thus, for any fc > 4, 

©^ Pr (^ l|Hb|11 ^' l|b " = fc ) 
<exp|-^f^(l + o 5 (l)) l0g ^ l0giV 



(27) 



(28) 



2k 



m 



m 



(29) 
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as (m, N) — ► oo with 



log AT log A; 
um <^— (1 + 05(1)) 



where 



(m,JV)— >oo 7Tt 



05(1) 



2A- 



2 'log^-M. 



log fc V 2(5 

Another upper bound is needed for k = 1, 2, 3. For a fixed taking one of these values, 

Pr ( — HHbH-L < 5 



in 



Pr 



m z — ' 



< 



'2tt 
2||b|| 



2m ft 111-11 \ 



It is straightforward to verify that ^ ■ Aijbj is Gaussian and that 

2||b| 



Thus 



E 



2||b| 



2tt 



Vk,V2kt 



and therefore, 



is a sum of independent zero-mean subgaussian random variables. Furthermore, ||b|| 2 € 
5 — ||b|| 2 < 0. Hence, we can apply Theorem [T2l of Appendix lAl as a result, there exist positive constants c 3j fc and 
C4 fc such that 

Pr f — llHblL < 5 

< c 3ifc exp (-c 4;fc m (5 - ||b|| 2 ) 

< c 3jfc exp ^-c 4jfc m (Viz - \ . 
Note that the values of c 3i fc and c^k depend on k. Consequently, 

^WprfillHbll^Mbllo- 



< c 3ifc exp { -mk \ c 4jfc ( 1 - — 



log (At) log TV 



m 



m 



(30) 



as m, N —>■ 00 with 



logJV 
lim < C4 
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Finally, substitute the upper bounds of d29b and d30l ) into the union bound of Equation ( T25T ). If K is large enough 
so that 

logiC 



and if 



iK (l + 0{ (i)) <C4>fc ^i_ — ] for all A; = 1,2, 3, 



(l + o 5 (l))< mm ——(1 + 05(1)) 



4K — 4<fc<2K" 2/c 

where 05 (1) is as given in ( T23T ), then the desired result (1241) holds. 

Based on Theorem [U we are ready to characterize the asymptotic region in which Pr (E^f) E\) — > 0. 
Theorem 9: Define A and H as in Theorem [9] For a given H, define the diagonal matrix 

raj ||hi 1^ 

Ah = 

mj ||hjv||i 

For a given d £ (0, 1), choose a 5 > such that d (1 + 8) < 1. Define the set E\ as in ((2T|) . For sufficiently large 

log iV log X 



K, if 



lim 



where 



then 



2 



< 



(l + o(l)), 



o(l 



logiC 



(l0g7T-l-l0g2), 



(31) 



(m 



lim Pr(— HHAHCbi-ba)^ < d, £1 ) =0 



for all pairs of h>i, b2 G Bk such that bi / b2- 

Proof: Let b' = Ah (bi — b 2 ). On the set E\, since 

1 



Mi < 1 + 5, 



for all 1 < j < N, all the nonzero entries of (1 + 5) h' satisfy 

1(1 + 5)6',! >1. 

Replace b in Theorem [8] with (1 + 5) h'. All arguments used in the proof of Theorem [8] are still valid, except that 
now, the higher order term (l23l ) in the asymptotic expression reads as 

Od(i+s) (1) 

= (logvr - 1 - log 2 - log(d(l + 6))) 

log K 

> r ^-(l0g7T-l-l0g2). 

log K 



This completes the proof. 



VIII. Proof of the Lower Bound for Nonnegative /i-WSCs 

The proof follows along the same lines as the one described for Zi-WSCs. However, there is a serious technical 
difficulty associated with the analysis of nonnegative Zi-WSCs. Let A £ i mxiV be a standard Gaussian random 
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matrix. For general Zi-WSCs, we let 



H i,i = ^T A hi> l<i<m, l<j <N, 

and therefore, 

is a Gaussian random variable, whose parameters are easy to determine. However, for nonnegative Zi-WSCs, one 
has to set 



H i,j = |j4iJ '' ' 1 - * - m ' 1 - j ~ N ' (32) 



Since the random variables Hijs are not Gaussian, but rather one-sided Gaussian, 

is not Gaussian distributed, and it is complicated to exactly characterize its properties. 

Nevertheless, we can still define E\ and E2 as in Equations (T2TT) and ((22l) . The results of Theorem |7] are still 
valid under the non-negativity assumption: the norms of all H columns concentrate around one in the asymptotic 
regime described in Theorem [7] The key step in the proof of the lower bound is to identify the asymptotic region 
in which any two different superpositions are sufficiently separated in terms of the l\ -distance. We therefore use 
an approach similar to the one we invoked twice before: we first prove a simplified version of the claim, and then 
proceed with proving the needed result by introducing some auxiliary variables and notation. 

Theorem 10: Let A G ]^ mxAr be a standard Gaussian random matrix. Let H be the matrix with entries 



H ij = IAjI , l<i<™, l<j <N. 

Let 5 G (0, 1) be given. For a given sufficiently large K, if 

logiV logK 

< feP < ~nr ( x + °* W) 

(m,Af)^oo m 4A 

where o t (1) is given in d39l ), then 

lim Pr f — llHbL < 8 ) =0 (33) 

(m,JV)->cx) \ m / 

for all beB£ and ||b|| < 2K. 

Proof: Similarly as for the corresponding proof for general Zi-WSCs, we need a tight upper bound on the 
moment generation function of the random variable 

k 

^ \Ai t j\ 

For this purpose, we resort to the use of the Central Limit Theorem. We first approximate the distribution of 
J2j=i bj \ by a Gaussian distribution. Then, we uniformly upper bound the approximation error according to 
the Berry-Esseen Theorem (see [14] and Appendix IB1 for an overview of this theory). Based on this approximation, 
we obtain an upper bound on the moment generating function, with leading term (log k) / \fk (see Equation (l38l) 
for details). 
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To simplify the notation, for a b G B^ t with 1 1 Too 1 1 = k, let 



N 



Y h>k = ^2 — \Aj\bj, 



where Ajs are standard Gaussian random variables. Then, 

Pr ( — ||Hb||-, < 5, ||b| 
V m 



< expjm (a5 + logE e _a|Yb fc| )} 



where the inequality holds for all a > 0. Now, we fix a and upper bound the moment generating function as 
follows. Note that 



E 



-a[*b,k| 



E 

+ E 



e -a|n.»| |y bfc | >-\ og ^fk 
a 

q| M Hb.fcl < -log^ 



Q: 



(34) 
(35) 



The first term (1341 ) is upper bounded by 



E 



° logv/ ", \Y h _ k \>-lo R Vk 



< E 



|*b,fc| > ^log\/A; 



< -LprMF b)fc | > ^log^) 



< 



1 



Vk' 



(36) 



In order to upper bound the second term in Equation 051 ), we apply Lemma |2] from the Appendix, proved using 
the Central Limit Theorem and the Berry-Esseen result: 



E 



1, \Yh,k\ < -logVk 
a 



Pr ( |lb.fcl < -logVk 
a 



Pr 



h,k\ 

k 



i=i 



< ; — log Vk 



2tt a 



2 1 logVk t 3 „ 

< -r= ^7= h 12— ^E 

V27T avr VA; V k 



1 1 



log k 



Vk V2tt V a7r 
Combining the upper bounds in (l36l ) and (I37T ) shows that 



+ 48i c 



(37) 



E 



-a|>b,fc| 



1 



1 / log 



2tt \ an 



+ 48t c 



<_ I 1 + ^ + 24^ 
V% V 4a 



(38) 
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Next, set a = 1/5. Then 



Pr 



??? 



iHblh < S, ||b| 



< exp <J -m ( - log k ) (1 + o t)fe (1)) 



where 



°t,& (!) 



2 + 21og(l + ^ + 24f 



log k 



Now we choose a ko £ Z + such that for all k > ko, 

log k 



2k 



1 + 0^(1)) >0. 



It is straightforward to verify that k$ is well defined. Consider the case when 1 < k < ko. It can be verified that 



is Subgaussian and that 



E 



'J 



> 1 



for all b G i?^ such that ||b|| = k. By applying the large deviations result for Subgaussian random variables, as 
stated in Theorem [El and the union bound, it can be proved that there exists a > such that 



Pr 

< exp ^ — ink ( r/, 
0. 



log (4i) log JV 



m 



/7? 



The above result holds whenever m, iV — » oo simultaneously, with 



Finally, let K be sufficiently large so that 

logiC 



logiV 
Iim < Cfc. 

(m,iV)^oo m 



log k 



and 



(1 + o t>2K (1)) < ; min -g- (1 + o t)fe (1)) , 

ko<k<2K Zfc 



(1 + °t,2X I 1 )) < mi?, c fc- 
4A l<fc<fc 
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Then 



\ll b llo<2^ ^ V 

^E(I) (^(-llHbll^*, ||b|| = * 

< exp < — mk ( Cfc 
k=i ^ ^ 



log (At) log N 



k=k„+l * 



log (At) log N 



m 



m 



as m, N —* oo with 



where 



log TV logiC 
hm < -^r- (1 + ot (1)) , 



(m,7V)^oo m 



4K" 



2 + 21og(^l + i^ + 24t 3 
log(2iC) 



(39) 



Based on Theorem [TOl we can characterize the rate region in which any two distinct superpositions are sufficiently 
separated in the l\ space. 

Theorem 11: Define A and H as in Theorem [TOl For a given H, define the diagonal matrix 



mj ||hi|L 







m/||hjv||i 

Also, for d G (0, 1), choose a 5 G (0, |) such that d (1 + 5) < 1. Define the set £i as in (ED. Provided that if is 
sufficiently large, if 



log JV log If 
hm < — — (1 + o t (1)) 



4K 



where 



then it holds 



ot(l) 



2 + 21og (l + + 648t J 

1) = 

log(2K) 



(40) 



(m 



lim Pr f 1 ||HA H (bi - b 2 )|| 1 < d, E x ) = 0, 

,A r )-»oo \m / 



for all pairs of bi, b 2 G £>jf such that bi / b 2 . 

Proof: The proof is very similar to that of Theorem |T0j The only difference is the following. Let b' 
Ah (bi — b 2 ). Since 

-<l-6< — WhAL <l + 6<~, 
2 - - II Jin - - 2 > 
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all the nonzero entries of (1 + S) b' on the set E\ satisfy the following inequality 

1 < |(l + -5)6-| < 3t. 

As a result, we have the higher order term o t (1) as given in Equation (l40l ). ■ 

IX. Conclusions 

We introduced a new family of codes over the reals, termed weighted superimposed codes. Weighted superimposed 
codes can be applied to all problems in which one seeks to robustly distinguish between bounded integer valued 
linear combinations of codewords that obey predefined norm and sign constraints. As such, they can be seen as a 
special instant of compressed sensing schemes in which the sparse sensing vectors contain entries from a symmetric, 
bounded set of integers. We characterized the achievable rate regions of three classes of weighted superimposed 
codes, for which the codewords obey I2, h, and non-negativity constraints. 

Appendix 

A. Subgaussian Random Variables 

Definition 4 (The Subgaussian and Sub exponential distributions): A random variable X is said to be Subgaus- 
sian if there exist positive constants c\ and C2 such that 

Ft(\X\ > x) < Cl e- C2x2 Vx>0. 

It is Subexponential if there exist positive constants c\ and C2 such that 

Pr(|X| > x) < c ie - C2X Vx>0. 

Lemma 1 (Moment Generating Function): Let X be a zero-mean random variable. Then, the following two 
statements are equivalent. 

1) X is Subgaussian. 

2) 3c such that E [e aX ] < c ca \ Ma > 0. 

Theorem 12: Let X\,--- ,X n be independent Subgaussian random variables with zero mean. For any given 
01, • • • ,a„£l, akXk is a Subgaussian random variable. Furthermore, there exist positive constants c\ and C2 
such that 



Pr 



-2, 



> x < Cl e- C2X /l|a|| 2, Vx > 



where ||a||£ = 

Proof: See [15, Lecture 5, Theorem 5 and Corollary 6]. ■ 

We prove next a result that asserts that translating a Subgaussian random variable produces another Subgaussian 
random variable. 

Proposition 1: Let X be a Subgaussian random variable. For any given aeR, Y = X + a is a Subgaussian 
random variable as well. 

Proof: It can be verified that for any j/gR, 

{y-af < \y 2 -a 2 , 

and 

{y + af < \y 2 -a 2 . 
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Now for y > \a\, 

Pr > y) = Pr (X + a > y) + Pr (X + a < -y) 

< Pr(X > y - a) +Pr (X < -y - a) . (41) 

When a > 0, 

()4Tj) < Pr(|X| > y-a) 

< cie- C2( ^ a)2 

< cic C2a2 e- C2i/2 / 2 . (42) 

When a < 0, 

(HO < Pr(|X| > y + a) 

< cie- C2fe+a)2 

< cic C2a2 e- C2y2/2 . (43) 
Combining Equations (|42l) and (l43l . one can show that 

Pr(|y| > y) < Cl c C2a2 e- C2y2/2 , Vy > \a\ . 

On the other hand, 

Pr(|F| < y) < 1 < e c ^ 2 /2 e -c 2 y 2 /2 ; Vy < _ 
Let c 3 = max (cie C2a2 , e Caa2/2 ) and c 4 = c 2 /2. Then 

Pr(|y| >y) <c 3 e- C4i/2 . 

This proves the claimed result. ■ 

B. The Berry-Esseen Theorem and Its Consequence 

The Central Limit Theorem (CLT) states that under certain conditions, an appropriately normalized sum of 
independent random variables converges weakly to the standard Gaussian distribution. The Berry-Esseen theorem 
quantifies the rate at which this convergence takes place. 

Theorem 13 (The Berry-Esseen Theorem): Let Xi, X2, ■ ■ ■ , Xk be independent random variables such that E [Xi] = 
0, E [Xf] = al E [\X?\] = Pi . Also, let 

2 2 1 1 2 

Sfc = 0"l + • • • + <7jfc, 

and 

ffc = Pi H h /Ofc- 

Denote by the cumulative distribution function of the normalized sum (X\ + • • • + XjS) /st, and by jV the 
standard Gaussian distribution. Then for all x and k, 

\F k (x)-jY{x)\ <6% 

The Berry-Esseen theorem is used in the proof of the lower bound for the achievable rate region of nonnegative 
Zi-WSCs. In the proof, one need to identify a tight bound on the probability of a weighted sum of nonnegative 
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random variables. The probability of this sum lying in a given interval can be estimated by the Berry-Esseen, as 
summarized in the following lemma. 

Lemma 2: Assume that b G B\ is such that ||b|| = k, let X\, X2, ■ ■ ■ , X^ be independent standard Gaussian 
random variables. For a given positive constant c > 0, one has 



Pr 



i=i 



— 1 c log Vk t 3 

< clogVk I < + 12p-=, 

7T Vk Vk 



where p := E 



X\ 



Proof: This lemma is proved by applying the Berry-Essen theorem. Note that the bj |X/|'s are independent 
random variables. Their sum 2~2j=i bj \Xj \ can be approximated by a Gaussian random variable with properly chosen 
mean and variance, according to the Central Limit Theorem. In the proof, we first use the Gaussian approximation 
to estimate the probability 



Pr 



I>l*il 

3=1 



< c 



log Vk 



Then we subsequently employ the Berry-Essen theorem to upper bound the approximation error. 
To simplify notation, let 

k 

Yb,k = \Xj\, 
3=1 

and let j¥ (x) denote, as before, the standard Gaussian distribution. Then, 

Pr(|Y bjfc | <clogVk) 



< Pr 



< Pr 



< Pr 



: log Vk c log Vk 



Y h , k 



Yh,k < clogVk 



c log Vk c log Vk 
Vk ' Vk 



Vk 



Pr 



Ibl 



< 



clog Vk 



Vk 



where in the second inequality we used the fact that bj > 1, so that Ylj=i > k. 
According to Theorem \13\ for all x € M and all k, 



Pr 



Ibl 



< x) — jV (x) 



< 



Qkpt 3 _ 6pt 3 



3/2 



since £)J =1 | & ^ | 3 < kt 3 , and }~2j=i \bj\ 2 > k. 
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Thus, 



Pr(|l r b,fc| <clog^) 




Vk 



2 clog\/fe 12pt 3 



which completes the proof of the claimed result. 
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